Generating Sentences by Editing Prototypes
نویسندگان
چکیده
We propose a new generative model of sentences that first samples a prototype sentence from the training corpus and then edits it into a new sentence. Compared to traditional models that generate from scratch either left-toright or by first sampling a latent sentence vector, our prototype-then-edit model improves perplexity on language modeling and generates higher quality outputs according to human evaluation. Furthermore, the model gives rise to a latent edit vector that captures interpretable semantics such as sentence similarity and sentence-level analogies.
منابع مشابه
Weighting Prototypes . A New Editing Approach
It is well known that editing techniques can be applied to (large) sets of prototypes in order to bring the error rate of the Nearest Neighbour classifier close to the optimal Bayes risk. However, in practice, the behaviour of these techniques uses to be much worse than expected from the asymp-totic predictions. A novel editing technique is introduced here which explicitly aims at obtaining a g...
متن کاملWeighting Prototypes. A New Editing Approach
It is well known that editing techniques can be applied to (large) sets of prototypes in order to bring the error rate of the Nearest Neighbour classifier close to the optimal Bayes risk. However, in practice, the behaviour of these techniques uses to be much worse than expected from the asymp-totic predictions. A novel editing technique is introduced here which explicitly aims at obtaining a g...
متن کاملMonolingual Post-Editing by a Domain Expert is Highly Effective for Translation Triage
Various small-scale pilot studies have found that for at least some documents, monolingual target language speakers may be able to successfully post-edit machine translations. We begin by analyzing previously published post-editing data to ascertain the effect, if any, of original source language on post-editing quality. Schwartz et al. (2014) hypothesized that post-editing success may be more ...
متن کاملEditing Prototypes in the Finite Sample Size Case Using Alternative Neighbourhoods
The recently intro(hwed concept of Nearest Centroid Neight)orhood is applied to discard outlirrs and prototypes in cl,~s overlapping regions in order to improve the performance of the Nearest Neighbor rule through an etliting i)rocedure. This apl)roach is related to graph b~sed editing algorithms which also define alternatiw, neighborhoods in t[,rms of geometric relations. Cl,~si('al e([iting a...
متن کاملAdvancing Chimeric Antigen Receptor-Engineered T-Cell Immunotherapy Using Genome Editing Technologies: Challenges and Future Prospects
Chimeric antigen receptor engineered-T (CAR-T) cells also named as living drugs, have been recently known as a breakthrough technology and were applied as an adoptive immunotherapy against different types of cancer. They also attracted widespread interest because of the success of B-cell malignancy therapy achieved by anti-CD19 CAR-T cells. Current genetic toolbox enabled the synthesis of CARs ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1709.08878 شماره
صفحات -
تاریخ انتشار 2017